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Description 

INTEGRATED CIRCUITS HAVING 
POST-SILICON ADJUSTMENT CONTROL 

Background of Invention 

[0001] This application is a continuation-in-part of copending 

application Serial No. 10/063,450, filed April 24, 2002, ti- 
tled "Data-Based Control of Integrated Circuits." 
Summary of Invention 

[0002] jhjs invention relates the use of data obtained by testing 
integrated circuit (IC) chips to control the performance of 
the IC chip. In particular, it relates to simulating the oper- 
ation of an integrated circuit chip and programming the 
reference data table of the chip with information based on 
data obtained from that simulation. 

[0003] integrated circuit (IC) chips are being required to have 

higher execution speeds and better performances. In or- 
der to meet such requirements, IC chips having a million 
or more transistors are placed on a die. This requires that 
narrower dimensions, such as 130nm or less, must be 



used. Manufacturing chips that precisely is difficult due to 
variations in threshold voltages, the resistance of wires, 
and the capacitance of lines. These variations are intro- 
duced during the deposition of layers, thermal annealing, 
and other processes. In the paper, "Impact of Die-to-Die 
and Within-Die Parameter Fluctuations on the Maximum 
Clock Frequency Distribution for Gigascale Integration," by 
Keith A. Bowman, IEEE Journal of Solid-State Circuits, Vol. 
37, No. 2, February, 2002, these parameter fluctuations 
are shown to influence the maximum operating frequency 
or power consumption distributions within a die or be- 
tween dies. This paper predicts that when 50 nm technol- 
ogy is employed, a generation of performance improve- 
ments may be lost due to systematic within-die fluctua- 
tions. 

[0004] Another difficulty is the power consumption problem. Mil- 
lion or gigascale integration requires a huge power con- 
sumption. Some microprocessors already require more 
than 50W per die, which necessitates the use of a cooling 
system, such as a fan. Even when a cooling system is 
used, the temperature of die will be above room tempera- 
ture, in some cases up to 100°C. But initially the die is at 
ambient temperature which, in the case of a car or outside 



usage, could be -20°C in the winter. 

[0005] Figure 1 of the paper, "A 4-GHz 130nm Address Genera- 
tion Unit With 32-bit Sparse-Tree Adder Core," by S. 
Mathew, et al., IEEE Journal of Solid-State Circuits, VOL. 
38, No. 5, May, 2003, gives a picture that shows the ther- 
mal distribution within die. In this picture there are hot- 
spots in the die. Thus, there are problems in IC chips with 
parameter fluctuations and extreme operating tempera- 
tures, as well as local parameter fluctuations and internal 
thermal distributions. 

[0006] The paper, "Effectiveness of Adaptive Supply Voltage and 
Body Bias for Reducing Impact of Parameter variations in 
Low Power and High Performance Microprocessors," by 
James W. Tschanz, etal., IEEE Journal of Solid-State Cir- 
cuits, VOL. 38, No. 5, May, 2003, shows that controlling 
the adaptive supply voltage and the body bias is effective 
in overcoming problems with those parameter variations. 

[0007] | n u.S. Patent No. 5,710,800, the supply voltage and clock 
rate are controlled using an internal control circuit. 

[0008] | n u.S. Patent No. 6,345,362, each functional unit in an IC 
chip has an independently controllable threshold voltage. 
The instructions to a chip are decoded to determine which 
functional units are needed to execute the instructions. 



The process speed of the units can then be adjusted to 
the optimal power level. A status table indicates the 
present power status of each of the functional units and a 
requirements table identifies the units required to execute 
a particular instruction. 

[0009] | n u.S. Patent No. 5,996,083, the rate of program execu- 
tion is controlled by software that changes the data bus 
width and the power latency values. The power consump- 
tion per second can be decreased by changing the data 
bus width from 64 bits to 32 bits, but this approximately 
doubles the program execution time. The execution time 
for each program is determined by the clock rate and the 
power latency, which is the time required to change the 
applied voltage or the status condition (e.g., standby or 
execute). The power latency time is needed to stabilize 
the applied power and the program execution and it adds 
to the total consumption of power and the total operation 
speed. Changes in data bus width and power latency are 
controlled by the instruction program. In U.S. Patent No. 
5,996,083, the power consumption or program execution 
speed is improved by changing the program using an in- 
ternal control circuit. 

[0010] | n u.S. Patent No. 6,484,265, software control technology 



by supply voltage, clock signal, and body bias for perfor- 
mance, power consumption, and temperature control is 
disclosed. 

[° 011 ] In those technologies the values the control circuit works 
with are predetermined for the chips. The control circuit 
becomes complicated when it covers various cases or var- 
ious parameter fluctuations. 
Brief Description of Drawings 

[0012] Figure 1 is a schematic diagram of an IC chip. 

[0013] Figure 2 is a schematic diagram of the control circuit 

shown in Figure 1 and a simulation tester. 
[0014] Figure 3 is a diagram illustrating a clock wave form and a 

delay in two clock lines between starting points Al and 

Bland end points A2 and B2, respectively. 
[0015] Figure 4 is a diagram showing one embodiment of a 

deskew method for adjusting the delay time in clock lines 

A and B. 

[0016] Figure 5 is a diagram illustrating the results of using the 

deskew method shown in Figure 4. 
[0017] Figure 6 is a diagram illustrating a voltage dividing circuit 

for a local voltage supply. 
[0018] Figure 7 is a diagram illustrating an embodiment for post- 



silicon adjusting a voltage dividing circuit. 
Detailed Description 

[0019] | n Figure 1, large scale integrated circuit (LSI) chip 1, ei- 
ther as one of many IC chips on a wafer or as an assem- 
bled individual IC chip, has a data input 2 and a data out- 
put 3. Chip 1 comprises numerous partitioned circuit 
blocks 4, each of which has an internal temperature mon- 
itor circuit, or some temperature-generating or tempera- 
ture-sensitive blocks have such monitors, but the temper- 
ature monitor circuit could also comprise a single circuit 
block 4. Control circuit block 5controls various parame- 
ters of circuit blocks 4, such as clock speed, operation 
clock speed, supply voltage, and body-bias voltage. 

[0020] | n Figure 2, signal input 6 from each circuit block 4 con- 
tains information such as whether a circuit block is oper- 
ating or is on stand-by, the temperature of the circuit 
block and, if it is operating, at what clock speed it is oper- 
ating and/or what program it is executing. Signal input 6 
goes to monitor circuit 7, which determines which circuit 
blocks will be used next and at what clock speeds they 
will be used. Monitor circuit 7 accesses programmable 
reference data table 8 for the supply voltage, body-bias 
voltage, or other conditions needed to attain the desired 



performance, such as clock speed. For each circuit block 
4, control circuit 9 sends the supply voltage to be used to 
programmable supply voltage generator 10, which then 
generates the programmed supply voltage; control circuit 
9 sends the clock speed to be used to programmed clock 
speed and programmed skew controller 11, which sets the 
clock speed to be used; control circuit 9 sends the body- 
bias voltage to be used to programmable threshold volt- 
age controller 12, which sets the programmed threshold 
voltage by controlling the body-bias voltage; and control 
circuit 9 sends the instructions to be used to the program 
controller 13, which changes the program of the instruc- 
tion sets, such as the data bus width, or changes each cir- 
cuit block configuration program set, such as the rotation 
of the same function circuit block. The programmable 
voltage generator 18 is part of power supply circuit 10, 
programmable deskew circuit 19 is part of clock speed 
and skew controller 11, and programmable voltage gener- 
ator 20 is part of threshold voltage controller 12. 
[0021] Programmable reference data table 8 contains informa- 
tion, such as clock speed, power usage, or operating volt- 
age range, that is required attain a desired performance. 
The following is an example of the type of information 



that can be placed in table 8 for various temperatures: 



Desired Clock Speed 
(GHz) at 110°C 


Conditions Required to Obtain Desired Clock Speed 


Supply Voltage 
'(V) 


Body-Bias Voltage (V) for 
an N-type MOS Transistor 


1.0 


3.0 


-0.4 


1.5 


3.2 


-0.3 


2.0 


3.4 


+0.2 


2.5 


3.6 


+0.4 



Because power is current times voltage, the power used by 
a circuit block can be controlled by controlling its supply 
voltage. The supply voltages and body-bias voltages in 



the programmable reference data table should be consid- 
ered to be target voltages and voltages slightly above or 
below the target voltages may be permitted by the control 
circuits. 

[0022] Table 8 may contain data for an entire chip or it may con- 
tain different data for different circuit blocks 4 on a chip. 
It could even contain different data for different transis- 
tors or different groups of transistors on a chip. This data 
is accessed by control circuit 9 so that circuit blocks 4 
(Figure 1) operate at the optimal speed and voltages at 
various temperatures. 

[0023] Previously, the values in reference data table 8 were de- 
termined by the design of the chip. That is, the clock 
speeds, supply voltages, and body bias voltages that the 
chip was designed to operate at were entered into refer- 
ence data table 8. However, as explained hereinabove, 
each chip is slightly different and the designed values may 
not be the optimal values for any particular chip. (See "Im- 
pact of Die-to-Die and Within-Die Parameter Fluctuations 
on the Maximum Clock Frequency Distribution for Gigas- 
cale Integration," by Keith A. Bowman, IEEE Journal of 
Solid-State Circuits, VOL 37, No. 2, February 2002.) 

[0024] Referring again to Figure 2, in this invention reference 



data table 8 is programmable and values based on mea- 
sured data are entered into table 8. Tester 14 tests chips 
to determine whether the chips are acceptable. The test 
can be performed when the chips are part of a wafer or 
when individual chips have been cut from a wafer, assem- 
bled, and are ready to be shipped, or at both times. The 
test is preferably performed while the chips are on a 
wafer, however, as that is more economical. 
[0025] Tester 14 has a simulation circuit 15, which provides one 
or more programs for the chip to execute, a control circuit 
16, which controls the clock speeds, supply voltages, 
body-bias voltages, and programs of instruction sets used 
in the test, and the programs to be executed by the chip. 
The test can be performed on the entire chip and data can 
be entered into table for the entire chip. Alternatively, the 
test can be perform on individual circuit blocks, or even 
on individual transistors or groups of transistors, and data 
can be entered into table 8 for each circuit block, transis- 
tor, or group of transistors. The test typically consists of 
determining whether the chip can successfully execute 
one or more programs at various clock speeds, supply 
voltages, body-bias voltages, and programs of instruction 
sets at various temperatures. Preferably, for use in bat- 



tery-powered computers, the chip is tested at 2 to 4 dif- 
ferent supply voltages between 1 and 4 volts, at 2 to 4 
different body-bias voltages between -0.5 and +0.5 volts, 
and at 2 to 4 different clock speeds between 1 and 300 
MHz at maximum and minimum desired temperatures. 
Preferably, for use in plug-in computers, the chip is tested 
at 2 to 4 different supply voltages between 1 and 4 volts, 
at 2 to 4 different body-bias voltages between -0.5 and 
+0.5 volts, and at 2 to 4 different clock speeds between 
0.2 and 2 GHz at maximum and minimum desired tem- 
peratures. Of course, as chips improve, the preferred test 
voltages may fall while clock speeds increase. The test 
may also be performed at other conditions, such as at dif- 
ferent temperatures, for example, between 0 and 110°C. 
Typically, the data obtained is the minimum supply volt- 
ages and minimum body-bias voltages used in the test at 
which the chip successfully executed a program at each 
clock speed tested. The data may also show that the op- 
erating voltage values should be different from the prede- 
termined voltage values, which may change the range of 
the operating voltages. 
[0026] Control circuit 16 also evaluates the results of the test 
and, based on the test results, selects values to be en- 



tered into table 8. Data obtained during testing may be 
"adjusted" to allow for a margin of error or for other rea- 
sons. Program circuit 17 then programs table 8 by enter- 
ing values based on the test data into it. The values can 
be permanently fixed in table 8 by, for example, the anti- 
fuse method, the non-volatile memory method, or by 
other methods. In the anti-fuse method (see U.S. Patent 
No. 5,324,681, herein incorporated by reference), a value 
is fixed in table 8 by burning out a "fuse" in the chip. U.S. 
Patent No. 6,319,773B1, herein incorporated by reference, 
describes the non-volatile memory method. Alternatively, 
the values can be tentatively entered into table 8 so that 
the chip can be retested and new values entered into the 
table at a later time. Tentative values may be useful to al- 
low for erroneous testing or changes in the chip or the re- 
quirements for its use. 
[0027] | n y e t another embodiment of this invention, circuit blocks 
4 that contain a MEMS (Micro Electro Mechanical Systems) 
are adjusted post-silicon. U.S. Patent No. 6,639,488 dis- 
closes a mechanical switch for RF (Radio Frequency). This 
kind of mechanical system using semiconductor technol- 
ogy is affected by processing variations. Therefore, its 
performance, like applied voltage or electrical specifica- 



tions for switching, is different for each die or wafer. 
Therefore, when a circuit block has a MEMS component, a 
post-silicon adjustment according to this invention can be 
made. 

[0028] off-chip programmable memory can be used when the 
memory of the reference data table is not adequate. For 
example, the internal memory capacity may be insufficient 
or the memory manufacturing process may be difficult to 
merge with other circuit manufacturing processes. There- 
fore, while in the hereinabove example all the circuits are 
implemented on one chip, some off-chip circuit blocks 
are also contemplated. 

[0029] | n another aspect of this invention, the supply circuit is 

adjusted post-silicon using measured values. The operat- 
ing circuit blocks are influenced by parameter fluctuations 
that affect operation voltage, operating clock speed, and 
temperature range. Also, the supply voltage circuits, sup- 
ply clock line delays, and temperature monitor circuits are 
influenced by parameter fluctuations, so those circuits can 
also benefit from post-silicon adjustment using measured 
values. 

[0030] Figure 3 shows the wave forms of the start points Al and 
Bl, and the end points A2 and B2, for the clock lines A 



and B in Figure 1. The delay time for the clock line A is Dl 
and the delay time for the clock line B is D2 for the end 
points A2 and B2, respectively. For an accurate operation, 
the same clock time delay is needed for each circuit block. 
U.S. Patent No. 6, 594,807, herein incorporated by refer- 
ence, discloses delay cell technology for minimizing this 
delay time. 

[0031] Figure 4 shows a typical DLL (Delay Locked Loop) Circuit. 
This circuit is shown in U.S. Patent No. 6,614,275, herein 
incorporated by reference. This circuit has a voltage con- 
trolled delay circuit which consists of plurality of delay el- 
ements whose delay time is controlled by the charge 
pump circuit bias. The phase detector circuit compares 
the clock in signal to the clock out signal and, if there is a 
phase difference between them, it sends an adjustment 
signal to the charge pump circuit. The charge pump cir- 
cuit then sends a bias signal to the voltage controlled de- 
lay circuit to adjust its phase D to the one clock delayed 
phase of the clock in C (Figure 5). One embodiment is to 
prepare many taps (e.g., connecting points such as A and 
B in Figure 4) for the plurality of delay elements in the 
voltage controlled delay circuit. Each tap outputs each de- 
layed clock between zero delays to one clock to the clock 



in. If there are time delays such as A2 and B2 in the deliv- 
ered clock to each circuit block, the DLL circuit in pro- 
grammable clock speed controller 19 of Figure 2 delays 
clock line A by D2 minus Dl (see Figure 5), using post- 
silicon technology, so that the delay time of clock lines A 
and B are adjusted to the maximum delay D2. For exam- 
ple, if there are 10 delay elements and 1 clock delay is 10 
nsec, this means that the time from C to D is lOnsec in 
Fig. 5, and that each delay element delays by 1 nsec. If the 
delay time of Dl is 3nsec and the delay time of D2 is 
7nsec. then Dl must delay 4 nsec. This means that the A2 

th 

clock line would select the 4 tap. This invention has an 
advantage in comparison to the above-described delay 
cell insertion technology (U.S. Patent No. 6,614,275) be- 
cause, whether each clock line has delay cell or not, this 
results in a different supply voltage or temperature de- 
pendence for each clock line. The instant invention, how- 
ever, uses the same DLL circuit for each clock line, so that 
each clock line has the same dependence. 
[0032] | n another embodiment, the local supply voltage genera- 
tor circuit is post silicon adjusted. In U.S. Patent No. 
6,366,156, the supply voltage circuit is shown in Figure 9 
and in Figure 1 of the U.S. Patent No. 6,429,726, the sup- 



ply voltage circuit is disclosed using a select circuit. Both 
circuits are for body bias circuits and use a fixed voltage 
divider circuit, which consists of serial connections of a 
MOS transistor. Figure 6 of the instant invention illustrates 
a typical voltage divider circuit. In this circuit, Vred should 
be two-fifths (i.e., 40%) of Vref, but because of parameter 
fluctuations, each serially connected transistor character 
will be different, so the actual value of Vred may not be 
exactly two-fifths of Vref. Figure 7 illustrates one example 
of a post-silicon adjustment of the local supply voltage 
circuit, where Vred is adjusted to desired value by the 
programmable select circuit. 
[0033] | n u.S. Patent No. 5,890,100, a temperature monitor cir- 
cuit is disclosed. This technology uses the delay time de- 
pendence of the inverter circuit to temperature (i.e., the 
delay time is usually a linear function of temperature, with 
higher temperatures having longer delay times). This de- 
lay time dependence to temperature is influenced by 
manufacturing parameter fluctuations. Other temperature 
monitor circuits are disclosed in U.S. Patent No. 6,630,754 
where bandgap reference voltage technology is used. This 
technology is substantially constant over a range of tem- 
perature and parameter fluctuations, but for sensing them 



a circuit analog to digital converter circuit is needed and 
this circuit is complicated and is influenced by parameter 
fluctuations. 

[0034] a temperature monitor circuit gathers data on tempera- 
ture variations for each circuit block and that information 
is used to control the operating voltage, body-bias volt- 
age, or operating clock speed of that circuit block in order 
to maintain a predetermined temperature range or to op- 
timize operating conditions for each temperature using a 
post-silicon adjustment. In one embodiment of this in- 
vention, measured data that shows temperature variations 
in the temperature monitor circuit for each circuit block 
are used to correct data in the reference table that is ac- 
cessed by the temperature monitor circuit. For example, 
in the testing process the chip's ambient temperature may 
be changed from -10 to +110°C. During that time the 
chip doesn't operate and therefore doesn't generate heat. 
The tester monitors the ambient temperature of the chip, 
compares this data (e.g., voltage values) with data from 
the temperature monitor circuit in the chip being tested, 
and enters the correct data into the reference table for the 
chip. Currently, the temperature monitor circuit data out- 
put is a voltage value at a particular temperature that is 



based on designed data, and that designed data is in- 
putted into the reference data table. But in this invention 
if, for example, the designed voltage value is 5mV at 
+ 110°C and the correct value at +110°C is 7mV, based on 
an actual test, then the correct voltage value at that tem- 
perature is entered into the reference data table. When the 
chip is allowing enough time to balance ambient tempera- 
ture, the internal chip temperature will be equal to the 
ambient temperature because the chip will not be gener- 
ating more heat than it loses. 
[0035] | n y e t another aspect of this invention, a post-silicon ad- 
justment is made to the circuit block configuration. As 
described before, Figure 1 of the paper, "A 4-CHz 130nm 
Address Generation Unit With 32-bit Sparse-Tree Adder 
Core," by S. Mathew, et al., IEEE Journal of Solid-State Cir- 
cuits, VOL. 38, No. 5, May, 2003, shows the thermal dis- 
tribution within a die. In this picture, there are hot-spots 
in the die. In order to avoid these hot spots in the die, a 
plurality of circuits that generate heat are used for the 
same function and the particular circuit used is changed 
by program controller circuit 13, which is controlled by 
control circuit 9 and monitor circuit 7, which monitors 
their temperatures, referring to data in programmable 



reference data table 8. 
[0036] Referring to Figure 1, circuit blocks of CI, C2, and C3 are 
the same circuit as the AGU (Address Generation Unit) or 
CPU (Central Processing Unit) core, which can create ther- 
mal hotspots. One embodiment of this invention is that in 
the testing process after fabricating the silicon wafers, the 
performance of each circuit is evaluated and prioritized. 
Then the reference data table is programmed according to 
which configuration of these circuits blocks is the most 
effective and creates the fewest thermal problems. Refer- 
ring to Figure 1, circuit blocks of Dl, D2, and D3 each 
have identical cache memory blocks, which is requested 
fast access time. In the testing process, the performance 
of each cache memory block of Dl, D2, and D3 is evalu- 
ated, including its access time and its temperature depen- 
dence. Then the reference data table is programmed to 
generate the most effective configuration for using those 
circuit blocks. 

[0037] when these cache memories are large, one or more of Dl, 
D2, and D3 may be set as off-chip dies. For example, the 
die including the control circuit and the programmable 
reference data table and Dl, D2, and D3 dies are attached 
the same printed circuit board. After assembling these 



dies on a printed circuit board, this board is tested and its 
data is entered into the reference data table. 
[0038] This invention is not limited to Si transistor or Si crystal 
wafer base integrated circuits, but can be applied to any 
integrated circuit, including analog circuits, SOI ( Silicon 
On Insulator), Si-Ge, GaAs transistors, TFT( Thin Film 
Transistor), or carbon nano tube transistor based inte- 
grated circuits. 



